Introduction
Comprehensive quality control (QC) of single-cell RNA-seq data was
performed with the singleCellTK
package. This report contains information about each QC tool and
visualization of the QC metrics for each sample. For more information on
running this pipeline and performing quality control, see the documentation.
If you use the singleCellTK package for quality control, please include
a reference
in your publication.
Summary Statistics
SCTK-QC
|
|
d8b737fb
|
8628f96c
|
e7372715
|
84824920
|
c47a5959
|
All Samples
|
|
Number of Cells
|
4114
|
2920
|
5464
|
3683
|
3804
|
19985
|
|
Mean counts
|
6792.4
|
2291
|
3494
|
3253.4
|
2280
|
3721.8
|
|
Median counts
|
6925.5
|
1520.5
|
2705.5
|
2272
|
1639
|
2581
|
|
Mean features detected
|
2533.1
|
1232.8
|
1651.8
|
1554.9
|
1219.8
|
1671.9
|
|
Median features detected
|
2670.5
|
1005
|
1526
|
1358
|
1038
|
1467
|
|
scDblFinder - Number of doublets
|
198
|
123
|
413
|
214
|
360
|
1308
|
|
scDblFinder - Percentage of doublets
|
4.81
|
4.21
|
7.56
|
5.81
|
9.46
|
6.54
|
|
DoubletFinder - Number of doublets, Resolution 1.5
|
309
|
219
|
410
|
276
|
285
|
1499
|
|
DoubletFinder - Percentage of doublets, Resolution 1.5
|
7.51
|
7.5
|
7.5
|
7.49
|
7.49
|
7.5
|
|
CXDS - Number of doublets
|
361
|
399
|
744
|
560
|
407
|
2471
|
|
CXDS - Percentage of doublets
|
8.77
|
13.7
|
13.6
|
15.2
|
10.7
|
12.4
|
|
BCDS - Number of doublets
|
158
|
252
|
371
|
438
|
240
|
1459
|
|
BCDS - Percentage of doublets
|
3.84
|
8.63
|
6.79
|
11.9
|
6.31
|
7.3
|
|
SCDS Hybrid - Number of doublets
|
195
|
374
|
443
|
523
|
380
|
1915
|
|
SCDS Hybrid - Percentage of doublets
|
4.74
|
12.8
|
8.11
|
14.2
|
9.99
|
9.58
|
|
DecontX - Mean contamination
|
0.0649
|
0.0626
|
0.104
|
0.115
|
0.125
|
0.0961
|
|
DecontX - Median contamination
|
0.0335
|
0.0314
|
0.0697
|
0.0677
|
0.08
|
0.0542
|
The summary statistics table summarizes QC metrics of the cell
matrix. This table summarizes the mean and median of UMI counts and
median of genes detected per cell, as well as the number and percentages
of doublets and estimated ambient RNA scores per dataset.
General quality control
metrics
SingleCellTK utilizes the scater
package to compute cell-level QC metrics. The wrapper function
runPerCellQC can be used to separately compute QC metrics
on its own. The wrapper function plotRunPerCellQCResults
can be used to plot the general QC outputs. The QC outputs are
sum, detected, and percent_top_X.
sum contains the total number of counts for each cell.
detected contains the total number of features for each
cell. percent_top_X contains the percentage of the total
counts that is made up by the expression of the top X genes for each
cell. The subsets_ columns contain information for the
specific gene list that was used. For instance, if a gene list
containing mitochondrial genes named mito was used,
subsets_mito_sum would contains the total number of
mitochondrial counts for each cell.
Total Counts

Total Features

Percentage of Library Size
Occupied by Top 50 Expressed Features

Parameters
|
useAssay
|
counts
|
|
collectionName
|
NULL
|
|
geneSetList
|
NULL
|
|
geneSetListLocation
|
rownames
|
|
mitoRef
|
NULL
|
|
mitoIDType
|
NULL
|
|
mitoPrefix
|
NULL
|
|
mitoID
|
NULL
|
|
mitoGeneLocation
|
NULL
|
|
percent_top
|
50 100 200 500
|
|
use_altexps
|
FALSE
|
|
flatten
|
TRUE
|
|
detectionLimit
|
0
|
|
packageVersion
|
1.22.1
|
In this function, the inSCE parameter is the input
SingleCellExperiment object, while the useAssay parameter
is the assay object that in the SingleCellExperiment object the user
wishes to use.
Doublet Detection
Doublet Detection
Summary
scDblFinder

Scds_Cxds

Scds_Bcds

Scds_Hybrid

doubletFinder_1.5

DoubletFinder
DoubletFinder is a doublet detection algorithm which depends on the
single cell analysis package Seurat.
The wrapper function runDoubletFinder can be used to
separately run the DoubletFinder algorithm on its own. The wrapper
function plotDoubletFinderResults can be used to plot the
QC outputs from the DoubletFinder algorithm. The DoubletFinder outputs
are doubletFinder_doublet_score, which is a numeric
variable of the likelihood that a cell is a doublet, and the
doubletFinder_doublet_label, which is the assignment of
whether the cell is a doublet.
d8b737fb
Resolution: 1.5
DoubletFinder Doublet
Assignment

DoubletFinder Doublet
Score

Density of Doublet
Score

Violin of Doublet
Score

Parameters
|
useAssay
|
counts
|
|
seed
|
12345
|
|
seuratNfeatures
|
2000
|
|
seuratPcs
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
|
|
seuratRes
|
1.5
|
|
formationRate
|
0.075
|
|
nCores
|
NULL
|
|
verbose
|
FALSE
|
|
packageVersion
|
2.0.2
|
runDoubletFinder relies on a parameter (in Seurat)
called resolution to determine cells that may be doublets. Users will be
able to manipulate the resolution parameter through
seuratRes. If multiple numeric vectors are stored in
seuratRes, there will be multiple label/scores. The
seuratNfeatures parameter determines the number of features
that is used in the FindVariableFeatures function in
Seurat. seuratPcs parameter determines the number of
dimensions used in the FindNeighbors function in Seurat.
The formationRate parameter is the estimated doublet
detection rate in the dataset. aims to detect doublets by creating
simulated doublets from combining transcriptomic profiles of existing
cells in the dataset.
8628f96c
Resolution: 1.5
DoubletFinder Doublet
Assignment

DoubletFinder Doublet
Score

Density of Doublet
Score

Violin of Doublet
Score

Parameters
|
useAssay
|
counts
|
|
seed
|
12345
|
|
seuratNfeatures
|
2000
|
|
seuratPcs
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
|
|
seuratRes
|
1.5
|
|
formationRate
|
0.075
|
|
nCores
|
NULL
|
|
verbose
|
FALSE
|
|
packageVersion
|
2.0.2
|
runDoubletFinder relies on a parameter (in Seurat)
called resolution to determine cells that may be doublets. Users will be
able to manipulate the resolution parameter through
seuratRes. If multiple numeric vectors are stored in
seuratRes, there will be multiple label/scores. The
seuratNfeatures parameter determines the number of features
that is used in the FindVariableFeatures function in
Seurat. seuratPcs parameter determines the number of
dimensions used in the FindNeighbors function in Seurat.
The formationRate parameter is the estimated doublet
detection rate in the dataset. aims to detect doublets by creating
simulated doublets from combining transcriptomic profiles of existing
cells in the dataset.
e7372715
Resolution: 1.5
DoubletFinder Doublet
Assignment

DoubletFinder Doublet
Score

Density of Doublet
Score

Violin of Doublet
Score

Parameters
|
useAssay
|
counts
|
|
seed
|
12345
|
|
seuratNfeatures
|
2000
|
|
seuratPcs
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
|
|
seuratRes
|
1.5
|
|
formationRate
|
0.075
|
|
nCores
|
NULL
|
|
verbose
|
FALSE
|
|
packageVersion
|
2.0.2
|
runDoubletFinder relies on a parameter (in Seurat)
called resolution to determine cells that may be doublets. Users will be
able to manipulate the resolution parameter through
seuratRes. If multiple numeric vectors are stored in
seuratRes, there will be multiple label/scores. The
seuratNfeatures parameter determines the number of features
that is used in the FindVariableFeatures function in
Seurat. seuratPcs parameter determines the number of
dimensions used in the FindNeighbors function in Seurat.
The formationRate parameter is the estimated doublet
detection rate in the dataset. aims to detect doublets by creating
simulated doublets from combining transcriptomic profiles of existing
cells in the dataset.
84824920
Resolution: 1.5
DoubletFinder Doublet
Assignment

DoubletFinder Doublet
Score

Density of Doublet
Score

Violin of Doublet
Score

Parameters
|
useAssay
|
counts
|
|
seed
|
12345
|
|
seuratNfeatures
|
2000
|
|
seuratPcs
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
|
|
seuratRes
|
1.5
|
|
formationRate
|
0.075
|
|
nCores
|
NULL
|
|
verbose
|
FALSE
|
|
packageVersion
|
2.0.2
|
runDoubletFinder relies on a parameter (in Seurat)
called resolution to determine cells that may be doublets. Users will be
able to manipulate the resolution parameter through
seuratRes. If multiple numeric vectors are stored in
seuratRes, there will be multiple label/scores. The
seuratNfeatures parameter determines the number of features
that is used in the FindVariableFeatures function in
Seurat. seuratPcs parameter determines the number of
dimensions used in the FindNeighbors function in Seurat.
The formationRate parameter is the estimated doublet
detection rate in the dataset. aims to detect doublets by creating
simulated doublets from combining transcriptomic profiles of existing
cells in the dataset.
c47a5959
Resolution: 1.5
DoubletFinder Doublet
Assignment

DoubletFinder Doublet
Score

Density of Doublet
Score

Violin of Doublet
Score

Parameters
|
useAssay
|
counts
|
|
seed
|
12345
|
|
seuratNfeatures
|
2000
|
|
seuratPcs
|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
|
|
seuratRes
|
1.5
|
|
formationRate
|
0.075
|
|
nCores
|
NULL
|
|
verbose
|
FALSE
|
|
packageVersion
|
2.0.2
|
runDoubletFinder relies on a parameter (in Seurat)
called resolution to determine cells that may be doublets. Users will be
able to manipulate the resolution parameter through
seuratRes. If multiple numeric vectors are stored in
seuratRes, there will be multiple label/scores. The
seuratNfeatures parameter determines the number of features
that is used in the FindVariableFeatures function in
Seurat. seuratPcs parameter determines the number of
dimensions used in the FindNeighbors function in Seurat.
The formationRate parameter is the estimated doublet
detection rate in the dataset. aims to detect doublets by creating
simulated doublets from combining transcriptomic profiles of existing
cells in the dataset.
ScDblFinder
scDblFinder
is a doublet detection algorithm in the scran package.
scDblFinder aims to detect doublets by creating a simulated doublet from
existing cells and projecting it to the same PCA space as the cells. The
wrapper function runScDblFinder can be used to separately
run the scDblFinder algorithm on its own. The wrapper function
plotScDblFinderResults can be used to plot the QC outputs
from the scDblFinder algorithm. The output of scDblFinder is a
scDblFinder_doublet_score and
scDblFinder_doublet_call. The doublet score of a droplet
will be higher if the it is deemed likely to be a doublet.
d8b737fb
ScDblFinder Doublet
Assignment

ScDblFinder Doublet
Score

Density Score

Violin Score

Parameters
|
useAssay
|
counts
|
|
nNeighbors
|
50
|
|
simDoublets
|
19985
|
|
seed
|
12345
|
|
packageVersion
|
1.8.0
|
The nNeighbors parameter is the number of nearest
neighbor used to calculate the density for doublet detection.
simDoublets is used to determine the number of simulated
doublets used for doublet detection.
8628f96c
ScDblFinder Doublet
Assignment

ScDblFinder Doublet
Score

Density Score

Violin Score

Parameters
|
useAssay
|
counts
|
|
nNeighbors
|
50
|
|
simDoublets
|
19985
|
|
seed
|
12345
|
|
packageVersion
|
1.8.0
|
The nNeighbors parameter is the number of nearest
neighbor used to calculate the density for doublet detection.
simDoublets is used to determine the number of simulated
doublets used for doublet detection.
e7372715
ScDblFinder Doublet
Assignment

ScDblFinder Doublet
Score

Density Score

Violin Score

Parameters
|
useAssay
|
counts
|
|
nNeighbors
|
50
|
|
simDoublets
|
19985
|
|
seed
|
12345
|
|
packageVersion
|
1.8.0
|
The nNeighbors parameter is the number of nearest
neighbor used to calculate the density for doublet detection.
simDoublets is used to determine the number of simulated
doublets used for doublet detection.
84824920
ScDblFinder Doublet
Assignment

ScDblFinder Doublet
Score

Density Score

Violin Score

Parameters
|
useAssay
|
counts
|
|
nNeighbors
|
50
|
|
simDoublets
|
19985
|
|
seed
|
12345
|
|
packageVersion
|
1.8.0
|
The nNeighbors parameter is the number of nearest
neighbor used to calculate the density for doublet detection.
simDoublets is used to determine the number of simulated
doublets used for doublet detection.
c47a5959
ScDblFinder Doublet
Assignment

ScDblFinder Doublet
Score

Density Score

Violin Score

Parameters
|
useAssay
|
counts
|
|
nNeighbors
|
50
|
|
simDoublets
|
19985
|
|
seed
|
12345
|
|
packageVersion
|
1.8.0
|
The nNeighbors parameter is the number of nearest
neighbor used to calculate the density for doublet detection.
simDoublets is used to determine the number of simulated
doublets used for doublet detection.
Cxds
CXDS, or co-expression based doublet scoring, is an algorithm in the
SCDS
package which employs a binomial model for the co-expression of pairs of
genes to determine doublets. The wrapper function runCxds
can be used to separately run the CXDS algorithm on its own. The wrapper
function plotCxdsResults can be used to plot the QC outputs
from the CXDS algorithm. The output of runCxds is the doublet score,
scds_cxds_score.
d8b737fb
Cxds Doublet
Assignment

Cxds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
binThresh
|
0
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runCxds, the ntop parameter is the number of top
variance genes to consider. The binThresh parameter is the
minimum counts a gene needs to have to be included in the analysis.
verb determines whether progress messages will be displayed
or not. retRes will determine whether the gene pair results
should be returned or not. The user may set the estimated number of
doublets with estNdbl.
8628f96c
Cxds Doublet
Assignment

Cxds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
binThresh
|
0
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runCxds, the ntop parameter is the number of top
variance genes to consider. The binThresh parameter is the
minimum counts a gene needs to have to be included in the analysis.
verb determines whether progress messages will be displayed
or not. retRes will determine whether the gene pair results
should be returned or not. The user may set the estimated number of
doublets with estNdbl.
e7372715
Cxds Doublet
Assignment

Cxds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
binThresh
|
0
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runCxds, the ntop parameter is the number of top
variance genes to consider. The binThresh parameter is the
minimum counts a gene needs to have to be included in the analysis.
verb determines whether progress messages will be displayed
or not. retRes will determine whether the gene pair results
should be returned or not. The user may set the estimated number of
doublets with estNdbl.
84824920
Cxds Doublet
Assignment

Cxds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
binThresh
|
0
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runCxds, the ntop parameter is the number of top
variance genes to consider. The binThresh parameter is the
minimum counts a gene needs to have to be included in the analysis.
verb determines whether progress messages will be displayed
or not. retRes will determine whether the gene pair results
should be returned or not. The user may set the estimated number of
doublets with estNdbl.
c47a5959
Cxds Doublet
Assignment

Cxds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
binThresh
|
0
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runCxds, the ntop parameter is the number of top
variance genes to consider. The binThresh parameter is the
minimum counts a gene needs to have to be included in the analysis.
verb determines whether progress messages will be displayed
or not. retRes will determine whether the gene pair results
should be returned or not. The user may set the estimated number of
doublets with estNdbl.
Bcds
BCDS, or binary classification based doublet scoring, is an algorithm
in the SCDS
package which uses a binary classification approach to determine
doublets. The wrapper function runBcds can be used to
separately run the BCDS algorithm on its own. The wrapper function
plotBCDSResults can be used to plot the QC outputs from the
BCDS algorithm. The output of runBcds is scds_bcds_score,
which is the likelihood that a cell is a doublet.
d8b737fb
Bcds Doublet
Assignment

Bcds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
srat
|
1
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
nmax
|
tune
|
|
varImp
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runBcds, the ntop parameter is the number of top
variance genes to consider. The srat parameter is the ratio
between original number of cells and simulated doublets. The
nmax parameter is the maximum number of cycles that the
algorithm should run through. If set to tune, this will be
automatic. The varImp parameter determines if the variable
importance should be returned or not.
8628f96c
Bcds Doublet
Assignment

Bcds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
srat
|
1
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
nmax
|
tune
|
|
varImp
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runBcds, the ntop parameter is the number of top
variance genes to consider. The srat parameter is the ratio
between original number of cells and simulated doublets. The
nmax parameter is the maximum number of cycles that the
algorithm should run through. If set to tune, this will be
automatic. The varImp parameter determines if the variable
importance should be returned or not.
e7372715
Bcds Doublet
Assignment

Bcds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
srat
|
1
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
nmax
|
tune
|
|
varImp
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runBcds, the ntop parameter is the number of top
variance genes to consider. The srat parameter is the ratio
between original number of cells and simulated doublets. The
nmax parameter is the maximum number of cycles that the
algorithm should run through. If set to tune, this will be
automatic. The varImp parameter determines if the variable
importance should be returned or not.
84824920
Bcds Doublet
Assignment

Bcds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
srat
|
1
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
nmax
|
tune
|
|
varImp
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runBcds, the ntop parameter is the number of top
variance genes to consider. The srat parameter is the ratio
between original number of cells and simulated doublets. The
nmax parameter is the maximum number of cycles that the
algorithm should run through. If set to tune, this will be
automatic. The varImp parameter determines if the variable
importance should be returned or not.
c47a5959
Bcds Doublet
Assignment

Bcds Doublet Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
ntop
|
500
|
|
srat
|
1
|
|
verb
|
FALSE
|
|
retRes
|
FALSE
|
|
nmax
|
tune
|
|
varImp
|
FALSE
|
|
estNdbl
|
TRUE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
In runBcds, the ntop parameter is the number of top
variance genes to consider. The srat parameter is the ratio
between original number of cells and simulated doublets. The
nmax parameter is the maximum number of cycles that the
algorithm should run through. If set to tune, this will be
automatic. The varImp parameter determines if the variable
importance should be returned or not.
ScdsHybrid
The CXDS-BCDS hybrid algorithm, uses both CXDS and BCDS algorithms
from the SCDS
package. The wrapper function runCxdsBcdsHybrid can be used
to separately run the CXDS-BCDS hybrid algorithm on its own. The wrapper
function plotScdsHybridResults can be used to plot the QC
outputs from the CXDS-BCDS hybrid algorithm. The output of
runCxdsBcdsHybrid is the doublet score,
scds_hybrid_score.
d8b737fb
ScdsHybrid Doublet
Assignment

ScdsHybrid Doublet
Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
nTop
|
500
|
|
cxdsArgs
|
NULL
|
|
bcdsArgs
|
NULL
|
|
verb
|
FALSE
|
|
estNdbl
|
TRUE
|
|
force
|
FALSE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
All parameters from the runBCDS and runBCDS
functions may be applied to this function in the cxdsArgs
and bcdsArgs parameters, respectively.
8628f96c
ScdsHybrid Doublet
Assignment

ScdsHybrid Doublet
Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
nTop
|
500
|
|
cxdsArgs
|
NULL
|
|
bcdsArgs
|
NULL
|
|
verb
|
FALSE
|
|
estNdbl
|
TRUE
|
|
force
|
FALSE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
All parameters from the runBCDS and runBCDS
functions may be applied to this function in the cxdsArgs
and bcdsArgs parameters, respectively.
e7372715
ScdsHybrid Doublet
Assignment

ScdsHybrid Doublet
Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
nTop
|
500
|
|
cxdsArgs
|
NULL
|
|
bcdsArgs
|
NULL
|
|
verb
|
FALSE
|
|
estNdbl
|
TRUE
|
|
force
|
FALSE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
All parameters from the runBCDS and runBCDS
functions may be applied to this function in the cxdsArgs
and bcdsArgs parameters, respectively.
84824920
ScdsHybrid Doublet
Assignment

ScdsHybrid Doublet
Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
nTop
|
500
|
|
cxdsArgs
|
NULL
|
|
bcdsArgs
|
NULL
|
|
verb
|
FALSE
|
|
estNdbl
|
TRUE
|
|
force
|
FALSE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
All parameters from the runBCDS and runBCDS
functions may be applied to this function in the cxdsArgs
and bcdsArgs parameters, respectively.
c47a5959
ScdsHybrid Doublet
Assignment

ScdsHybrid Doublet
Score

Density Score

Violin Score

Parameters
|
seed
|
12345
|
|
nTop
|
500
|
|
cxdsArgs
|
NULL
|
|
bcdsArgs
|
NULL
|
|
verb
|
FALSE
|
|
estNdbl
|
TRUE
|
|
force
|
FALSE
|
|
useAssay
|
counts
|
|
packageVersion
|
1.10.0
|
All parameters from the runBCDS and runBCDS
functions may be applied to this function in the cxdsArgs
and bcdsArgs parameters, respectively.
Ambient RNA
Detection
Ambient RNA Detection
Summary
decontX

DecontX
In droplet-based single cell technologies, ambient RNA that may have
been released from apoptotic or damaged cells may get incorporated into
another droplet, and can lead to contamination. decontX,
available from the celda,
is a Bayesian method for the identification of the contamination level
at a cellular level. The wrapper function runDecontX can be
used to separately run the DecontX algorithm on its own. The wrapper
function plotDecontXResults can be used to plot the QC
outputs from the DecontX algorithm. The outputs of
runDecontX are decontX_contamination and
decontX_clusters. decontX_contamination is a
numeric vector which characterizes the level of contamination in each
cell. Clustering is performed as part of the runDecontX
algorithm. decontX_clusters is the resulting cluster
assignment, which can also be labeled on the plot.
d8b737fb
DecontX Contamination
Score

DecontX Clusters

Density Score

Violin Score

Parameters
|
z
|
NULL
|
|
maxIter
|
500
|
|
delta
|
10 10
|
|
estimateDelta
|
TRUE
|
|
convergence
|
0.001
|
|
varGenes
|
5000
|
|
dbscanEps
|
1
|
|
logfile
|
NULL
|
|
verbose
|
TRUE
|
|
packageVersion
|
1.12.0
|
8628f96c
DecontX Contamination
Score

DecontX Clusters

Density Score

Violin Score

Parameters
|
z
|
NULL
|
|
maxIter
|
500
|
|
delta
|
10 10
|
|
estimateDelta
|
TRUE
|
|
convergence
|
0.001
|
|
varGenes
|
5000
|
|
dbscanEps
|
1
|
|
logfile
|
NULL
|
|
verbose
|
TRUE
|
|
packageVersion
|
1.12.0
|
e7372715
DecontX Contamination
Score

DecontX Clusters

Density Score

Violin Score

Parameters
|
z
|
NULL
|
|
maxIter
|
500
|
|
delta
|
10 10
|
|
estimateDelta
|
TRUE
|
|
convergence
|
0.001
|
|
varGenes
|
5000
|
|
dbscanEps
|
1
|
|
logfile
|
NULL
|
|
verbose
|
TRUE
|
|
packageVersion
|
1.12.0
|
84824920
DecontX Contamination
Score

DecontX Clusters

Density Score

Violin Score

Parameters
|
z
|
NULL
|
|
maxIter
|
500
|
|
delta
|
10 10
|
|
estimateDelta
|
TRUE
|
|
convergence
|
0.001
|
|
varGenes
|
5000
|
|
dbscanEps
|
1
|
|
logfile
|
NULL
|
|
verbose
|
TRUE
|
|
packageVersion
|
1.12.0
|
c47a5959
DecontX Contamination
Score

DecontX Clusters

Density Score

Violin Score

Parameters
|
z
|
NULL
|
|
maxIter
|
500
|
|
delta
|
10 10
|
|
estimateDelta
|
TRUE
|
|
convergence
|
0.001
|
|
varGenes
|
5000
|
|
dbscanEps
|
1
|
|
logfile
|
NULL
|
|
verbose
|
TRUE
|
|
packageVersion
|
1.12.0
|
SoupX
In droplet-based single cell technologies, ambient RNA that may have
been released from apoptotic or damaged cells may get incorporated into
another droplet, and can lead to contamination. SoupX uses
non-expressed genes to estimates a global contamination fraction.The
wrapper function runSoupX can be used to separately run the
SoupX algorithm on its own. The main outputs of runSoupX
are soupX_contamination, soupX_clusters, and
the corrected assay SoupX, together with other intermediate
metrics that SoupX generates.soupX_contamination is a
numeric vector which characterizes the level of contamination in each
cell. SoupX generates one global contamination estimate per sample,
instead of returning cell-specific estimation.Clustering is required for
SoupX algorithm. It will be performed if users do not provide the label
as input. quickCluster() method from package scran is
adopted for this purpose. soupX_clusters is the resulting
cluster assignment, which can also be labeled on the plot. The wrapper
function plotSoupXResult can be used to plot the QC outputs
from the SoupX algorithm. Plots includes a UMAP with clustering labels
and a number of UMAPs colored with the soup fraction of top marker genes
which are identified for contamination estimation.
d8b737fb
SoupX Clustering

Soup Fractions
ENSG00000127329

ENSG00000186335

ENSG00000169347

ENSG00000242029

ENSG00000251504

Parameters
|
useAssay
|
counts
|
|
bgAssayName
|
NULL
|
|
assayName
|
SoupX
|
|
tfidfMin
|
1
|
|
soupQuantile
|
0.9
|
|
maxMarkers
|
100
|
|
contaminationRange
|
0.01 0.8
|
|
rhoMaxFDR
|
0.2
|
|
priorRho
|
0.05
|
|
priorRhoStdDev
|
0.1
|
|
forceAccept
|
FALSE
|
|
adjustMethod
|
subtraction
|
|
roundToInt
|
FALSE
|
|
tol
|
0.001
|
|
pCut
|
0.01
|
|
reducedDimName
|
SoupX_UMAP_d8b737fb
|
|
sessionInfo
|
x86_64-pc-linux-gnu
|
|
cluster
|
soupX_clusters
|
8628f96c
SoupX Clustering

Soup Fractions
ENSG00000070915

ENSG00000074803

ENSG00000109684

ENSG00000183580

ENSG00000251504

Parameters
|
useAssay
|
counts
|
|
bgAssayName
|
NULL
|
|
assayName
|
SoupX
|
|
tfidfMin
|
1
|
|
soupQuantile
|
0.9
|
|
maxMarkers
|
100
|
|
contaminationRange
|
0.01 0.8
|
|
rhoMaxFDR
|
0.2
|
|
priorRho
|
0.05
|
|
priorRhoStdDev
|
0.1
|
|
forceAccept
|
FALSE
|
|
adjustMethod
|
subtraction
|
|
roundToInt
|
FALSE
|
|
tol
|
0.001
|
|
pCut
|
0.01
|
|
reducedDimName
|
SoupX_UMAP_8628f96c
|
|
sessionInfo
|
x86_64-pc-linux-gnu
|
|
cluster
|
soupX_clusters
|
e7372715
SoupX Clustering

Soup Fractions
ENSG00000143882

ENSG00000169344

ENSG00000127329

ENSG00000186335

ENSG00000070915

Parameters
|
useAssay
|
counts
|
|
bgAssayName
|
NULL
|
|
assayName
|
SoupX
|
|
tfidfMin
|
1
|
|
soupQuantile
|
0.9
|
|
maxMarkers
|
100
|
|
contaminationRange
|
0.01 0.8
|
|
rhoMaxFDR
|
0.2
|
|
priorRho
|
0.05
|
|
priorRhoStdDev
|
0.1
|
|
forceAccept
|
FALSE
|
|
adjustMethod
|
subtraction
|
|
roundToInt
|
FALSE
|
|
tol
|
0.001
|
|
pCut
|
0.01
|
|
reducedDimName
|
SoupX_UMAP_e7372715
|
|
sessionInfo
|
x86_64-pc-linux-gnu
|
|
cluster
|
soupX_clusters
|
84824920
SoupX Clustering

Soup Fractions
ENSG00000070915

ENSG00000127329

ENSG00000148942

ENSG00000183287

ENSG00000119121

Parameters
|
useAssay
|
counts
|
|
bgAssayName
|
NULL
|
|
assayName
|
SoupX
|
|
tfidfMin
|
1
|
|
soupQuantile
|
0.9
|
|
maxMarkers
|
100
|
|
contaminationRange
|
0.01 0.8
|
|
rhoMaxFDR
|
0.2
|
|
priorRho
|
0.05
|
|
priorRhoStdDev
|
0.1
|
|
forceAccept
|
FALSE
|
|
adjustMethod
|
subtraction
|
|
roundToInt
|
FALSE
|
|
tol
|
0.001
|
|
pCut
|
0.01
|
|
reducedDimName
|
SoupX_UMAP_84824920
|
|
sessionInfo
|
x86_64-pc-linux-gnu
|
|
cluster
|
soupX_clusters
|
c47a5959
SoupX Clustering

Soup Fractions
ENSG00000242029

ENSG00000074803

ENSG00000250799

ENSG00000251504

ENSG00000143882

Parameters
|
useAssay
|
counts
|
|
bgAssayName
|
NULL
|
|
assayName
|
SoupX
|
|
tfidfMin
|
1
|
|
soupQuantile
|
0.9
|
|
maxMarkers
|
100
|
|
contaminationRange
|
0.01 0.8
|
|
rhoMaxFDR
|
0.2
|
|
priorRho
|
0.05
|
|
priorRhoStdDev
|
0.1
|
|
forceAccept
|
FALSE
|
|
adjustMethod
|
subtraction
|
|
roundToInt
|
FALSE
|
|
tol
|
0.001
|
|
pCut
|
0.01
|
|
reducedDimName
|
SoupX_UMAP_c47a5959
|
|
sessionInfo
|
x86_64-pc-linux-gnu
|
|
cluster
|
soupX_clusters
|